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The topic treated is that of finding a reproducible, plausible and computationally simple method of 
selecting a discrete frequency distribution with a prescribed ranking of its components. The problem 
is shown to be tractable when a minimax error selection criterion is employed, and "error" is measured 
by maximum absolute deviation between components. The vertices of the polyhedron of optimal solu- 
tions can also be found explicitly, and so their centroid can be calculated if unique specification is 
required. 
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1 . Introduction 

In the mathematical modeling efforts associated with an operations research study, one may 
well have only incomplete information on which to base a representation of the probabilities of 
the various outcomes of some pertinent chance event. Under these circumstances, one should of 
course examine the consequences of several alternative probability distributions, each consistent 
with the information at hand. It still seems desirable, however, to have a systematic and repro- 
ducible method for arriving at a single "nominal" distribution, to serve as a base-point for such 
sensitivity analyses. 

This note works out the mathematics of one approach, based on a "minimax error" criterion, 
to the selection of a nominal distribution. The "incomplete information" is assumed to consist 
of a ranking of the individual terms of the probability distribution. 1 A previous paper 2 considered 
the case in which the given information consists of prescribed upper and lower bounds on these 
terms. 

Let P be the polyhedron of real ra-vectors x (where n > 1) whose components x\ satisfy 

O^x, ^* 2 ^ . . . ^x n , (1.1) 

£i*i = l. (1.2) 

Our objective is to choose xeP to minimize 

F(x)= max {rf(x, y):yeP}, (1.3) 

where d is the metric on n -space given by 

,/(x, y)= max,- \xi — y,\. (1.4) 



1 Study of this situation was suggested by J. McLynn of Davidson, Talbird and McLynn, Inc. 

1 A. J. Goldman and P. R. Meyers, Minimax error selection of a discrete univariate distribution with prescribed componentwise bounds, J. Res. NBS 72B (Math. 
Sci.) No. 4 263-271 (1968). 
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Other metrics may be investigated subsequently. 

The following section accomplishes the explicit evaluation of F(x) in highly tractable form. 
The minimization of F(x) over P, which begins section 3, is then trivial. It turns out, however, that 
for n > 2 there will be a convex polyhedron of optimal x's rather than a single one; section 3 goes 
on to propose the centroid of this polyhedron's vertices as a plausible "representative" choice, and 
includes a determination of both vertices and centroid. 

2. Evaluation of F 

The particular metric (1.4) has the pleasant property that F(x) —for any compact set P, not 
only the one defined by (1.1) and (1.2) — can readily be evaluated in terms of the quantities 

Mi=max {ynyeP}, (2.1) 

mi = min {yr.yeP}. (2.2) 

Indeed, we have 

F(x)= max,, max,- max (y } — *;, xt — yt) 

= max, max {max y (y/ — *;), max,, fe — y*)}, 

or finally 

F(x)=ma.Xj max {Mi — xu X{ — nn}. (2.3) 

For the particular polyhedron P which figures here, it is easily verified that 

Mf= l/(7i+ l-i) (l^i^n), (2.4) 

m = (l^i<n), (2.5) 

m n =l/n. (2.6) 

Thus we have 

F(x)= max {max,{l/(/i-h 1 — /) — *,}, x n -i, x n — l/n}. 

The term corresponding to i = n is 

1 X n = 2*? — Xj ^ #n-l , 

so we can replace the last equation by 

F(x)= max {max, {l/(rc + 1 — i) — Xi}, x n — l/n}. (2.7) 

Next, suppose the term with i = n—l yields the maximum in (2.7). Then we should have 

l/2-Xn-i^Xn-lln, (2.8) 

l/2-Xn-i^ l-Xn. (2.9) 

Addition yields 
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l — 2x tl -\ ** 1 — 1M, 

which is equivalent to x n -\ ^ l/2/i. Thus x, ^ 1/2// for 1 ^ / < // — 1, and so 

1 — Xn-i—x n = %¥~ 2 Xi ^ (n — 2)l2n. 

This and (2.8) imply 

(n + 2)l2n-x n -i ^x tt =Un+(x„- 1///)^ I/// + (l/2-x n -,)= (// . + 2)l2n-x n -i- (2.10) 

Since equality holds in (2.10), it must hold throughout the preceding sequence of equalities. In 
particular it holds in (2.8) and (2.9), so that the maximum in (2.7) does not occur uniquely for the 
term with i=n — 1, and hence this term can be deleted from (2.7): 

F(x)= max {max;<„_i{l/(//-h 1 — i)— xt}, l—x„, x„ — l//i}. (2.11) 

For // = 2, (2.11) gives 

F(x)= max {1 -x 2 , * 2 - 1/2}. (2.12) 

We next assume // ^ 3. 

Now suppose the maximum in (2.11) holds for the term corresponding to some /'<//— 1. Then 
we should have 

l/(//+ 1 -l)-Xi^ 1 -*„, (2.13) 

ll(n+l-i)-xt*x H -lln. (2.14) 



Addition yields 



or equivalently 



2l(n + l-i)-2xi2* 1-1/n, 



2x,-^2/(/i + l-i)+l//i-l 
= [2// + (// -f 1 - i)-n(n + 1 + /)]///(// + 1 - i) 
= [1 + 2// - ri 2 + (// - l)/j//i(/? H- 1 - f) 
= [2 - (n - 1) 2 + (n -l)i\/n(n + 1 - i) 
= [2 - (// - 1) (n - 1 - £)]/7i(n 4- 1 - i). 
Since x; ^ 0, we must have 

2^ (n-l)(n-l-i). (2.15) 

But // — 1 5? 2 and // — i > 1. Thus this situation is impossible for n > 3, i.e.. 

F(x) = max {l-x tt , x fl -l/n} (n>3). (2.16) 
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And it is possible for ra = 3 only when i = Ji — 2 = 1, in which case equality holds in (2.15), hence 
throughout the preceding inequalities, so that the maximum in (2.11) does not hold uniquely for 
the term corresponding to i= 1. We have now shown that, for all n > 1, 

F(x) = max {!-*», x n - l/n}. (2.17) 

3. Determination of Optimal Distributions 

The minimization of F(x) over P is trivial in view of (2.17), which can be rewritten 

F(x) = l-x n for0^x w ^ U+1)/2ai, 

(3.1) 
F{-x)=x n -lln for (n-hl)/2/i ^x n ^ 1. 

We see at once that 

F min =(n-l)/2n, (3.2) 

and that the minimizing xeP are precisely those with 

x n =(n+l)l2n. (3.3) 

One such x is given by 

x, = 1/2/1 forl^i^Ai-1. (3.4) 

But this choice is somewhat arbitrary, and moreover honors the original ranking (for x\, . . ., 
x n -i) only in a technical sense. In view of the original intention to fasten on a single x, it seems (to 
the writer) less arbitrary to select the centroid of the vertices of the polyhedron of F — minimizing 
xeP. This centroid is obtained by adjoining an nth component (n+ l)/2/i, as in (3.3), to the centroid 
of the polyhedron Q in (n— 1) -space consisting of those z with 

O^z,^ ... *s=zn_i^ (n+l)l2n, (3.5) 

2f- 1 z f =l- (n+l)l2n= (n-l)/2n. (3.6) 

Note that in view of (3.6), the last inequality in (3.5) can be dropped. 

To determine the vertices of Q, note that a nonsingular linear transformation preserves 
extreme points of polyhedra. This applies in particular to the transformation associated with the 
substitutions 

U\=Z U 

Ui = Zi — Zi-\ ( 2 ^ i ^ n — 1 ) , 

which convert Q into the polyhedron Q' defined in u-space by 

m^O (l^f^n-1), (3.7) 

2f-?(n-i)u i =(n-l)l2n. (3.8) 

Q' consists of the intersection of the hyperplane //, denned by (3.8), with the nonnegative orthant. 
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Thus its extreme points are the intersections of H with the (n—\) positive coordinate axes, i.e., 
the (n — 1 ) points u ( ^ given by 

w^ = 8/j ( /i - I )/2/i(n --./). (3.9) 

The corresponding extreme points z { J ] of Q are given by 

zU>=o (Ui^i), 

Thus the centroid z* of Q has coordinates 

z < *=(ii-l)- 1 ^ 8 i(ii-l)/2ii(ii-y) = (l/2ii)^(it-j)- 1 , (3.10) 

which unlike (3.4) are strictly increasing in i. 

We conclude by pointing out two somewhat disquieting features of the solution obtained above. 
The first [see (3.3)] is that the chance-event outcome ranked most probable is assigned a probability 
greater than 1/2, though only slightly so for large n. The second [see (3.2)] is the size of F min . . . 
nearly 1/2 for large n. But perhaps this should not be surprising in view of the extremely conserva- 
tive nature of minimax decision criteria. 

(Paper 72B4-277) 
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